added why.md for the environments#23
added why.md for the environments#23yogesh1801 wants to merge 98 commits intosys-intelligence:mainfrom
Conversation
…stinguish-api-keys Distinguish the models used in the executor and evaluator
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
Signed-off-by: Tarek <tareknaser360@gmail.com>
…s/sysmobench/sysmobench_core'
- Add gpt-4o model configuration to models.yaml - Fix setup_tools.py to use shutil.move instead of os.rename This resolves 'Invalid cross-device link' error when /tmp is on different filesystem
…rse_lab_bench Course Lab Benchmark: Add Instructions for Extending the Benchmark
…benchmark Adding ArtEvalBench v0.9
…adding new artifacts to ArtEvalBench
Co-authored-by: Tarek Elsayed <60650661+tareknaser@users.noreply.github.com>
Co-authored-by: Tarek Elsayed <60650661+tareknaser@users.noreply.github.com>
Improving the "contributor's guide" and simplifying the benchmark's schema
bastoica
left a comment
There was a problem hiding this comment.
I'm a bit confused as why you added a new WHY.md for arteval_bench. Is this simply the existing file or did you make any edits? Thanks!
|
Hi @bastoica sorry for the confusion the arteval benchmark file is same, it is error from my side that it looks like a new file in commit, but it is the same |
Signed-off-by: Yogesh <yogeshsingla481@gmail.com>
|
sounds good, thanks @yogesh1801 |
|
@tareknaser can you help review if the new WHY.md make sense to you? |
|
@Qian-Cheng-nju can you help review if the new WHY.md of SysMoBench works for you? Welcome any comments. |
Updated the number of systems and their types in the benchmark description.
We recently added the ringbuffer module from Asterinas and ZooKeeper, so I updated the description of the number and types of systems. Everything looks good to me now. Thank you very much for such a detailed document! |
Description
This PR addresses Issue #22 by adding dedicated WHY.md files to each benchmark directory and linking them from the root README. These files explain why each benchmark matters and how it fits into the broader vision of system intelligence, following the pattern established in PR #21.
Changes
Testing
Checklist